Back

Advanced Science

Wiley

Preprints posted in the last 7 days, ranked by how well they match Advanced Science's content profile, based on 249 papers previously published here. The average preprint has a 0.77% match score for this journal, so anything above that is already an above-average fit.

1
Multiplexed temporal SWCNT biosensor combined with convolutional autoencoding identifies ALS-specific serum protein corona signatures

Sirtori, R.; Lopez, R. M.; Li, H.; Liu, C.; Fisk, N.; Roxbury, D. E.; Fallini, C.

2026-06-08 neurology 10.64898/2026.06.08.26354966 medRxiv
Top 0.8%
12.7%
Show abstract

Amyotrophic lateral sclerosis (ALS) lacks a validated blood-based diagnostic, and the field is increasingly moving from single-molecule markers toward integrative, multi-component signatures. Here we present a liquid-biopsy strategy that transduces disease dependent serum-nanoparticle interactions into a learnable near-infrared spectral phenotype. A sensor array of twelve DNA-functionalized single-walled carbon nanotube (SWCNT) chiralities, functionalized with (GT)6 ssDNA coupled with a deep learning model was tested on serum from 20 ALS patients and 19 age- and sex-matched controls (n = 39, TargetALS). Our multiplexed sensor design (12 SWCNT chiralities) and data acquisition strategy based on excitation-emission matrices acquired at three timepoints (0, 6, 24 h) was conceived to maximize sensor carried information. Indeed, we show that the array generates partially independent temporal dynamics across chiralities governed primarily by tube diameter. To decode this multiplexed, time-resolved signal, we trained a dual-objective convolutional autoencoder that jointly optimizes reconstruction and classification, achieving 84.6% cross-validated accuracy (AUC = 0.87). Selected latent features were reproducible across an independent same-subject experimental batch and correlated with serum neurofilament light chain, linking the spectral phenotype to a clinically relevant neurodegeneration marker. Mass spectrometry supported a molecular basis for discrimination, revealing an ALS-biased protein corona enriched in adaptive-immune and inflammatory proteins. Together, these results establish proof of principle that time-resolved, multi-chirality SWCNT spectral sensing can compress complex serum composition into a reproducible near-infrared biomarker signature for ALS.

2
A mechanistic model for genetic regulation of postmenopausal bone loss

Rattsev, I.; Mac Gabhann, F.; Hertz, D.; Taylor, C. O.

2026-06-08 endocrinology 10.64898/2026.06.04.26354968 medRxiv
Top 2%
8.5%
Show abstract

Bone remodeling is a tightly regulated physiological process that maintains bone health through coordinated action of bone-resorbing osteoclasts and bone-forming osteoblasts. Disruption of this balance, such as the one induced by estrogen decline after menopause, results in bone loss and osteoporosis. Genetic factors play an important role in determining bone mineral density (BMD) loss over time. However, translating genetic associations into individualized risk prediction remains challenging due to small effect size of individuals variants and non-linear interactions within the bone remodeling unit. Here, we present a bone cell population dynamics model that includes major regulatory pathways, such as the RANK/RANKL/OPG axis, Wnt signaling, and hormonal regulation by estrogen, parathyroid hormone, and TGF-{beta}. We calibrate the model on clinical data from healthy postmenopausal women, and women with reduced BMD undergoing anti-osteoporotic therapy. The calibrated model captures healthy BMD decline in postmenopausal women and therapeutic response to anti-osteoporotic medications. We mechanistically incorporate the effect of 22 variants across 8 genes involved in bone remodeling and simulate BMD trajectories in 1,000 virtual subjects differing by ancestry and genetic makeup. The median predicted 5-year BMD loss was 3.57% (95% prediction interval: 1.31-5.24), consistent with the values reported in the literature. The virtual individuals with African ancestry were predicted to experience the highest average 5-year BMD loss. The strongest genetic risk factors for bone loss were predicted to be CYP19A1 rs727479 and OPG rs3102735, while LRP5 rs11228240 emerged as a protective factor that could partially counteract the detrimental effects of other variants. Several epistatic effects were observed in the genetic interaction analysis. Mechanistically, our model suggested that estrogen exerts its effect on bone remodeling primarily by modulating osteoclast apoptosis. Overall, this framework demonstrates a proof-of-concept for integration of genetic risk factors into mechanistic models of disease and can be extended to other conditions with polygenic inheritance.

3
Order-Based Bayesian Network Modeling of Early Detection and Post-Diagnosis Control for Cardiovascular Disease Risk in Type 2 Diabetes

Kathuria, Y.; Miller, K.; Selden, E. B.; Gallagher, W. J.; Capan, M.

2026-06-12 primary care research 10.64898/2026.06.10.26355419 medRxiv
Top 2%
6.9%
Show abstract

Patients diagnosed with type 2 diabetes (T2D) are at increased risk of developing cardiovascular disease (CVD), the leading cause of morbidity and mortality in this population. Early detection and glycemic control within the first year after diagnosis reduce CVD risk. However, gaps remain in how to operationalize early detection of T2D using Electronic Health Record (EHR) data and quantify its relationship with subsequent CVD risk using longitudinal observations. We developed a probabilistic graph model to analyze the interdependencies between early detection of T2D, post-diagnosis glycemic control, and CVD occurrence. Using a temporally structured Bayesian Network (BN) learned from EHR data of 9,450 primary care patients between 2017 and 2023, we quantified probabilistic dependencies between demographics, diagnostic delay surrogates, glycemic control, and post-diagnosis CVD occurrence. Percentile based thresholds defined risk groups, where individuals with predicted probabilities in the bottom decile ([≤] 10th percentile) were classified as low risk, and those in the top decile ([≥] 90th percentile) as high risk. Results demonstrated heterogeneity in predicted risks across glycemic and cardiovascular outcomes. Predicted probability of developing CVD within the first year after T2D diagnosis ranged from a mean of 5.2% in the low-risk group to 28.9% in the high-risk group, while predicted probabilities of mean Hemoglobin A1c (HbA1c) [≥] 8% during the first year post-diagnosis ranged from 1.6% in low-risk to 55.1% in high-risk group. Patients with HbA1c at diagnosis [≥] 8% had higher predicted probabilities of first-year post-diagnosis mean HbA1c [≥] 8% (53.3% vs. 1.9%) and high HbA1c coefficient of variation (18.7% vs. 3.1%) compared with those with HbA1c [≤] 6.5%. Incorporating early clinical outcomes refined later risk predictions, with long-term CVD risk reaching 33.5% among high-risk individuals. The proposed model achieved predictive performance comparable to conventional machine learning approaches while providing interpretable relationships for risk stratification in primary care populations.

4
Context-dependent molecular responses to heterogeneous metabolic disease traits

Michalettou, T.-D.; Vinuela, A.

2026-06-08 endocrinology 10.64898/2026.05.31.26354544 medRxiv
Top 3%
6.1%
Show abstract

Metabolic diseases such as type 2 diabetes (T2D) arise through complex interactions between physiological, molecular, and environmental processes. Clinical traits including age, sex, adiposity, and glycaemic status are strongly associated with disease risk and progression, yet most molecular studies examine these factors independently and assume relatively static molecular regulation. Consequently, how physiological state dynamically reshapes molecular organisation across omics layers remains poorly understood. Here, we integrated transcriptomic, proteomic, metabolomic, and genetic data from 3,027 individuals in the IMI DIRECT cohort to characterise the joint molecular effects of age, sex, body mass index (BMI), and glycated haemoglobin (HbA1c). We identified widespread associations between these traits and molecular phenotypes. However, interaction analyses revealed a more complex context-dependent regulation, showing that the molecular effect of one trait frequently depends on the state of another, with sex-specific effects of age being more prominent. We also investigated relationships between different types of molecular phenotypes and how these relationships are modulated by metabolic disease relevant traits, demonstrating that cross-omic molecular coordination is itself dynamically remodelled by physiological and metabolic state. Probabilistic causal inference identified a directionally structured network of age-associated molecules, revealing pathways through which age effects propagate across omics layers, showcased in the example of the mTOR signalling pathway. Integration of this directed network with genetic colocalisation analyses also identified a sub-network relevant for T2D. Collectively, our findings demonstrate that metabolic disease relevant traits not only independently influence molecular phenotype abundance but also jointly reshape the directional organisation of cross-omic molecular networks. These results support a model in which metabolic disease susceptibility emerges through dynamic rewiring of interconnected molecular systems and provide a framework for context-dependent biomarker discovery, disease stratification, and precision metabolic medicine.

5
PhysiCase: Development and dual-layer validation of synthetic cases for health professional education: A pilot study leveraging Generative AI

Komolafe, O. O.; Roberts, A. C.; Shelley, J.; Tawiah, A. K.

2026-06-09 rehabilitation medicine and physical therapy 10.64898/2026.06.07.26355114 medRxiv
Top 4%
4.1%
Show abstract

High-quality, domain-specific datasets are foundational to advancing educational tools and AI systems in healthcare, yet assembling case repositories from real-world clinical records faces substantial privacy, ethical, and licensing barriers. Synthetic data generation offers a compelling pathway forward, but educational cases require rigorous validation to ensure clinical plausibility and pedagogical utility. This pilot study introduces PhysiCase, a dual-layer validation pipeline for synthetic case generation and evaluates the feasibility of combining automated LLM-based screening with expert educator review. We generated 128 synthetic musculoskeletal(MSK) cases using four frontier large language models (GPT-4.1, GPT-4o, Google Gemini 2.5 Pro, and Llama 4 Scout) across 28 clinical conditions. Cases underwent automated quality screening using an "LLM-as-judge" framework (DeepEval) assessing prompt alignment, JSON correctness, answer relevance, bias, toxicity, and completeness. Ninety cases (70.3%) passed automated filtering and proceeded to expert evaluation by four MSK physiotherapy educators, who rated medical accuracy, realism, fidelity, relevance, and usability on 5-point Likert scales. GPT-4.1 demonstrated the highest automated pass rate (96\%) and strongest expert ratings (medical accuracy 4.10/5, usability 4.38/5), while Llama 4 Scout showed the lowest pass rate (33.3%) and expert ratings. Expert-evaluated cases achieved strong content validity indices for usability (97.5%), relevance (97.5%), and realism (95%), though medical accuracy showed greater variance (CVI 87.5%). Cross-layer correlation analysis revealed that automated completeness metrics moderately aligned with expert usability ratings , while answer relevance and prompt alignment showed weak or negative correlations with clinical correctness. Qualitative analysis identified three primary failure modes: reductive logic, biomechanical inconsistency, and administrative/contextual gaps. The dual-layer validation framework proved methodologically viable: automated screening efficiently reduced expert review burden, while human judgment remained indispensable for detecting subtle clinical reasoning failures. LLM-generated synthetic cases has the potential to meet practical educational needs for MSK physiotherapy, but expert validation is essential to safeguard clinical accuracy. These findings support a scalable division of labour for synthetic case development, with targeted improvements to prompting and automated reasoning checks needed to address identified "nuance gaps." The code for this paper is available on https://github.com/kwid-ai/PhysiCase

6
Topological Deep Learning Identifies Polygenic Variant Clusters Across Familial Multimorbid Disorders

Vomo-Donfack, K. L.; Bousquet, G.; Falgarone, G.; Ginot, G.; Morilla, I.

2026-06-09 health informatics 10.64898/2026.06.03.26354242 medRxiv
Top 5%
3.6%
Show abstract

Whole-genome sequencing comprehensively captures coding, non-coding and structural variation in families with suspected inherited disorders, yet its clinical utility remains constrained by an interpretation bottleneck: selecting a handful of relevant variants from millions of candidates. Current rule-based pipelines, anchored in ACMG/AMP criteria, excel at identifying highly penetrant Mendelian alleles but frequently miss variants of low-to-moderate penetrance, non-coding alterations and germline-somatic interactions. Here we introduce PolyCLIP-T, a topology-guided multimodal framework that transforms variant selection from a classification problem into a geometric discovery task. By contrastively aligning DNA-sequence embeddings with functional annotations, PolyCLIP-T constructs a unified latent space in which the displacement between reference and alternate embeddings quantifies the molecular perturbation induced by each variant. Persistent homology then identifies stable topological components - coherent variant groups shared among affected relatives - that transcend single-variant scoring logic. Applied to six families with multi-morbid cancer, autoimmune and cardiovascular disease, PolyCLIP-T recovered non-coding and structural candidates overlooked by conventional pipelines and revealed pleiotropic networks spanning disease categories. This approach provides an interpretable, scalable solution for genome-first investigations of disorders driven by polygenic architectures that evade single-variant analysis. The framework was developed and benchmarked on deeply characterised familial cohorts selected for transgenerational multimorbidity; validation in larger, independent populations will be essential to establish its generalisability. An interactive web tool is freely available at https://www.polyclip-t.uma.es/.

7
Daily symptom monitoring is sustainable over months: retention, not compliance, is the primary barrier to long-duration digital tracking

Gunsilius, C. Z.; Pei, P.; Carayannopoulos, A.; Petzschner, F. H.

2026-06-10 rehabilitation medicine and physical therapy 10.64898/2026.06.08.26355180 medRxiv
Top 6%
3.6%
Show abstract

Ecological momentary assessment (EMA) enables real-time, longitudinal measurement of symptoms and behavior via smartphones, yet nearly all feasibility evidence comes from protocols lasting one to two weeks, far shorter than the timescales over which chronic diseases fluctuate and clinical decisions unfold. Whether daily compliance can be sustained over months, or whether it decays as short-protocol trends predict, is unknown. Here, 214 participants (173 with pain, 41 healthy controls) completed a 4-month (122-day) EMA protocol via the Soma smartphone app, generating 26,907 check-ins. Half the sample completed the full protocol without a two-week lapse. Aggregate compliance appeared moderate (50%), but this conflated two distinct phenomena: when recomputed over each participant's active period, compliance rose to 71%, with 91% achieving moderate-to-high adherence, and remained stable across all 17 study weeks. Pain status predicted earlier disengagement but not lower compliance among those who remained; after adjustment for differential retention, group differences disappeared. To our knowledge, this is the longest continuous daily EMA evaluation in a clinical population. It suggests the primary barrier to long-duration EMA is not declining motivation among active participants but concentrated early disengagement, with direct implications for the design of digital health protocols, decentralized trials, and remote symptom monitoring.

8
Room-Specialized Mixture-of-Experts for In-Home ADL Recognition with Ambient Sensors

Addepalli, V. r.; Rao, P.; Kiselica, A.; Kummerfeld, E.; Abdalnabi, N.; Lee, K.

2026-06-12 health informatics 10.64898/2026.06.10.26355390 medRxiv
Top 6%
3.5%
Show abstract

Monitoring activities of daily living (ADLs) in the home is a promising approach for tracking dementia progression in older adults. While ambient sensor-based ADL systems are well-studied, most existing ADL recognition systems rely on globally trained models that ignore the spatial organization of in-home activities. In real deployments, where training data are sparse and highly home-specific, global transformer models may fail to capture room-dependent behavioral structure. We propose a deterministic Mixture of Experts (MoE) architecture for in-home ADL recognition, in which each expert is a compact transformer specialized to one room of the home (bedroom, kitchen, bathroom, living area). Input segments are routed using a deterministic gating strategy based on room-level motion activity and time-of-day priors for sleep-related behaviors. Unlike learned routing networks, the proposed gate encodes domain knowledge about where ADLs are likely to occur, reducing model complexity under limited per-home training data. By decomposing ADL recognition into room-specific activity spaces, the proposed architecture reduces competition between dominant and low-frequency activities under highly imbalanced residential data. We evaluated the system on data collected via low-cost ambient sensors (motion, light, temperature, humidity) and Raspberry Pi edge devices across five homes, with ground-truth ADL labels provided by participants and caregivers. Across the five homes, the proposed MoE consistently outperformed global transformer, 1D CNN, and Random Forest baselines, achieving macro-F1 scores ranging from 0.60 to 0.88, highlighting the importance of home-specific modeling in real-world deployments. These findings suggest that room-aware expert specialization may provide a practical and interpretable strategy for low-data ADL recognition in real-world residential environments.

9
BodyMAE: A Surface-Area Aware Masked Autoencoder for Body Composition Estimation from 3D Body Scans

Zheng, Y.; Feng, B.; Cheng, R.; Qiu, C.; Long, Z.; Vaziri, K.; Hahn, J.

2026-06-06 health informatics 10.64898/2026.06.04.26354925 medRxiv
Top 6%
3.5%
Show abstract

Accurate assessment of body composition is important to risk stratification and management of metabolic, musculoskeletal, and aging-related diseases, yet reference modalities such as Dual-energy X-ray absorptiometry (DXA) are costly and impractical for frequent monitoring. Commodity 3D body scans offer a low-cost, radiation-free alternative, but extracting meaningful and predictive shape features from scans remains challenging due to nonuniform point density, variable body size and cross-device differences. We introduce BodyMAE, a self-supervised, surface-area aware masked autoencoder for metric-scale 3D body scans. The pipeline integrates area-adjusted sampling, a long-range focused encoder, and a lightweight decoder regularized to promote locally uniform reconstructions. Trained and evaluated on 917 paired 3D body scans paired with clinical DXA reports, BodyMAE achieves strong accuracy on fat percentage (root-mean-square error (RMSE) 3.825 percentage points, R^2 0.908), fat mass (RMSE 3.694 kg, R^2 0.968), and lean mass (RMSE 3.608 kg, R^2 0.901), with competitive performance on bone mineral content (RMSE 0.284 kg, R^2 0.754).We also assess feature stability across pretrained baselines, finding higher retrieval accuracy for our representations (Top-1 90.131%). These results indicate that combining metric-aware sampling, long-range relational encoding, and local geometric regularization enables accurate body composition estimation from 3D body scans, as validated by comparisons to DXA-derived measurements.

10
Transcriptomic Architecture of Type 2 Diabetes in Human Pancreatic Islets:An Integrative Meta-Analysis and Machine Learning Framework for Biomarker Discovery

Romero, R.

2026-06-10 endocrinology 10.64898/2026.06.08.26355184 medRxiv
Top 6%
3.2%
Show abstract

Background. Type 2 diabetes mellitus (T2D) is defined by progressive pancreatic {beta}-cell dysfunction whose molecular underpinnings remain incompletely understood. Single-cohort transcriptomic analyses of donor islets have yielded heterogeneous gene lists of limited cross-study reproducibility, constraining both mechanistic interpretation and biomarker development. Methods. We combined two complementary analytical strategies applied to four public human islet transcriptomic cohorts (GSE25724, GSE20966, GSE38642, and GSE164416; n = 7-57 donors per contrast). For the integrative arm, three microarray datasets and one bulk RNA-seq dataset were processed independently and unified through gene-level random-effects meta-analysis, hallmark pathway scoring (GSVA/MSigDB), and iterative module refinement, yielding a two-axis disease framework. For the diagnostic arm, a consensus multi-method machine learning pipeline, combining LASSO penalized logistic regression, Support Vector Machine Recursive Feature Elimination (SVM-RFE), and Random Forest importance scoring, was applied to 184 differentially expressed genes from the RNA-seq cohort, with all normalization steps performed within leave-one-out cross-validation (LOOCV) folds to prevent data leakage. Machine learning classification of the RNA-seq cohort was additionally subjected to external transportability testing in the independent bulk human islet RNA-seq cohort GSE50244 using an overlap-restricted reduced score and a threshold fixed in the discovery cohort. Results. Meta-analysis across all four cohorts identified 337 high-confidence T2D-associated genes (96.1% directional concordance in beta-cell-enriched tissue). These were distilled into two refined 14-gene modules: ImmuneStress (MICB, HLA-DRA, HLA-DPA1, IL1R2, and others) and BetaCellIdentitySecretion (RASGRP1, PPP1R1A, SLC2A2, and others), whose composite IsletDysfunctionScore provided the most stable cross-platform separation of non-diabetic from T2D islets (Hedges' g = 1.80, p = 9.83 x $10^-17$, $\text{I}^2$= 0%). Consistent with progressive disease, IsletDysfunctionScore increased monotonically from non-diabetic to impaired glucose tolerance to T2D. Separately, the machine learning pipeline derived a 10-gene diagnostic panel: GABRA2, SLC2A2, ARG2, DKK3, PRIMA1, TAFA4, HHATL, PARVG, RNU1-70P, and the novel lncRNA ENSG00000284653, that achieved perfect discrimination in LOOCV (AUC = 1.000, sensitivity = 1.000, specificity = 1.000, zero misclassifications across all 57 donors). A leakage-verification experiment confirmed that this performance reflected genuine biological signal: global quantile normalization prior to cross-validation collapsed AUC to 0.380. External testing showed that 8 of the 10 panel genes were measurable in GSE50244. The frozen 8-gene reduced score retained strong discrimination (external AUC = 0.907), with 6 of 8 genes preserving directional concordance, but the discovery-derived threshold did not transfer because the external score distribution was shifted upward and compressed, yielding complete sensitivity but zero specificity at the frozen cutoff Conclusions. Integrating pathway-level meta-analysis with machine learning classification, we present a coherent two-axis model: immune/stress activation and loss of beta-cell identity/secretory competence, together with a compact, biologically interpretable 10-gene diagnostic signature. Panel genes converge on GABA signaling, glucose transport, arginine metabolism, WNT pathway inhibition, and a novel lncRNA, providing both mechanistic hypotheses and high-priority targets for external validation. These findings offer a reproducible transcriptomic scaffold for future mechanistic, biomarker, and clinical translation studies of human islet dysfunction. They also support external transportability of the core biological signal, while indicating that absolute operating thresholds are cohort-dependent and would require recalibration before deployment in independent datasets.

11
STDP-inspired temporal transition modeling for adaptive clinical risk prediction from electronic health records

Gong, L.; Aswani, N.; Shahinian, P.; Yang, J. Y.; Kontos, D.; Manji, G.; Kang, S.; Hur, C.

2026-06-09 health policy 10.64898/2026.06.04.26354919 medRxiv
Top 7%
2.9%
Show abstract

Electronic health record (EHR) prediction models often summarize longitudinal histories as static patient-level features, which may omit potentially informative event ordering. We developed a simplified spike-timing-dependent plasticity (STDP)-inspired framework that represents asynchronous EHR data as sparse, directional transition features. The approach encodes whether one clinical event precedes another within prespecified temporal windows, preserving event identity, directionality, and approximate timing while retaining feature-level interpretability. We evaluated this framework in two retrospective prediction tasks with different temporal scales: incident acute kidney injury (AKI) prediction in 17,351 MIMIC-IV ICU stays and early postoperative recurrence prediction in 713 CUMC patients with pancreatic ductal adenocarcinoma (PDAC). Models were compared with static burden features (demographics, comorbidities, raw lab measurements) and in addition with STDP transitional feature sets using patient-level cross-validation and rolling prediction horizons. In AKI, a calibrated STDP ensemble model showed higher discrimination than static burden alone at the 24-hour decision snapshot for AKI by 72 hours, with AUROC 0.838 versus 0.800, and at 48 hours for near-term AKI prediction, with AUROC 0.868 versus 0.827. In PDAC, STDP transition features modestly improved Day -30 preoperative recurrence prediction, with AUROC 0.611 versus 0.587 and AUPRC 0.323 versus 0.318 for static burden and showed similar performance at Day 0 (7 days before recorded surgery date), with AUROC 0.681 and AUPRC 0.363. Decision-curve and feature analyses suggested that selected temporal transitions were clinically interpretable across renal, inflammatory, hepatobiliary, hematologic, glycemic, and nutritional trajectories. These findings suggest that STDP-inspired transition features may provide a practical, interpretable way to incorporate temporal ordering into EHR-based risk prediction across both acute and longitudinal settings

12
Towards the Virtual Amyotrophic Lateral Sclerosis Patient: Inferring Cortical Excitability through Whole-Brain Dynamical Modeling

Angiolelli, M.; Demuru, M.; Lopez, E. T.; Hashemi, M.; Ziaeemeh, A.; Rabuffo, G.; Trojsi, F.; Granata, C.; Tafuri, D.; De Luca, M.; Gallo, E.; Jirsa, V.; Depannemaecker, D.; Sorrentino, P.

2026-06-10 neurology 10.64898/2026.06.09.26354829 medRxiv
Top 7%
2.8%
Show abstract

Amyotrophic lateral sclerosis (ALS) is increasingly recognized as a multisystem neurodegenerative disorder in which motor-neuron degeneration is accompanied by widespread alterations in cortical dynamics. Among its most reproducible neurophysiological signatures is cortical hyperexcitability, yet how this local excitability imbalance shapes distributed whole-brain activity remains poorly understood. Here, we combined source-reconstructed resting-state MEG data, tractography-informed whole-brain modeling, and simulation-based inference to investigate whether ALS-related alterations in large-scale brain dynamics can be mechanistically explained by changes in cortical excitability. First, we characterized empirical brain dynamics using complementary features spanning regional activity amplitude and variability, functional connectivity, and avalanche-based metrics. These analyses revealed significant alterations in ALS patients relative to healthy controls, as well as associations with clinical impairment and disease staging. To mechanistically interpret these changes, we employed a reduced Wong-Wang whole-brain model in which local recurrent excitation modulates emergent large-scale neural dynamics. Simulations showed that increasing excitability systematically reproduced the empirical dynamical signatures observed in ALS. We then applied a simulation-based inference framework to estimate latent excitability parameters directly from empirical observations. Whole-brain model inversion revealed increased excitability in ALS patients compared with controls. The recovered excitability parameter was associated with disease staging, supporting its clinical relevance as a model-derived descriptor of ALS progression. Finally, by extending the model to estimate frontal and non-frontal excitability separately, we found that ALS-related alterations were predominantly associated with increased frontal excitability, whereas non-frontal regions appeared comparatively less affected. The recovered parameters related to disease staging. Together, these findings provide a mechanistic framework linking altered large-scale brain dynamics in ALS to selective cortical hyperexcitability, explaining how local excitability changes can give rise to global network reorganization. More broadly, they show how computational model inversion can recover latent multiscale pathophysiological processes from empirical neural recordings, offering a non-perturbative alternative to complex experimental paradigms typically required to causally probe local-to-global mechanisms.

13
Aperiodic and oscillatory activity of the human brain during induced emotional states

Park, H.; Hacker, C.; Cho, H.; Xie, T.; Simmons, A.; Tan, G.; Leuthardt, E. C.; Brunner, P.; Willie, J.

2026-06-09 neurology 10.64898/2026.06.02.26354146 medRxiv
Top 7%
2.7%
Show abstract

Normal emotional experience depends on dynamic modulation of neural excitability across limbic and prefrontal circuits, yet the spectral markers that reflect these shifts in humans remain incompletely understood. In this study, we combined a validated video-based emotion induction paradigm with stereotactic electroencephalography (SEEG) in 31 patients with drug-resistant epilepsy to investigate how positive and negative affective states modulate oscillatory and aperiodic (asynchronous) neural activity. Using spectral parameterization to dissociate oscillatory power from the aperiodic 1/f component, we found that emotional valence robustly altered the aperiodic slope in a regionally specific manner: negative valence flattened the slope in thalamus, posterior insula, and posterior cingulate cortex, whereas positive valence produced flattening in dorsolateral prefrontal cortex. Simultaneous oscillatory changes included increased high-frequency activity and decreased alpha/beta power during negative affect, and reduced alpha power during positive affect, which were elucidated after adjusting for broadband aperiodic spectral shifts. These effects persisted after controlling for audiovisual stimulus or physiological features and were not evident in simultaneously recorded scalp EEG, underscoring their localization to intracranial sites. Together, these results provide the first direct evidence that active induction of emotional states modulates the aperiodic slope of human intracranial field potentials, reflecting valence-dependent shifts in local circuit excitability. The findings highlight the 1/f slope as a sensitive neural marker of affective brain states and for mood dysregulation.

14
ECG-derived age deviation predicts cardiovascular diseases across lead configurations and cohorts

Aydogdu, D.; Gaber, F.; Sorooshmehr, A.; Akalin, A.

2026-06-08 cardiovascular medicine 10.64898/2026.06.05.26354974 medRxiv
Top 8%
2.4%
Show abstract

Cardiovascular diseases (CVDs) remain the primary global health burden, motivating the search for robust, non-invasive risk biomarkers. We harness a foundation model pretrained on over 10 million recordings, to evaluate ECG-derived age deviation as a cross-cohort biomarker of CVD burden. A predictive model, trained exclusively on healthy subjects, achieved accurate age prediction. Diseased subjects exhibited significant positive age acceleration across multiple categories, with structural and ischemic heart diseases showing the largest effects. External validation in a hospital-based cohort (n=160,493) confirmed that age acceleration independently predicts all-cause mortality, with the strongest prognostic value in patients under 65 years. Furthermore, we demonstrated that disease discrimination and mortality prediction are preserved across 6-lead and single-lead configurations, supporting potential deployment in wearable or mobile devices. Our analysis also revealed a striking morphological confound from the complete left bundle branch block, leading us to propose absolute age deviation as a more robust, universal risk marker. These findings establish ECG-derived biological age deviation as a highly generalizable and clinically actionable biomarker for assessing cardiovascular risk. We have also developed a web application at https://bioinformatics.mdc-berlin.de/ECGage that allows users to easily test our framework.

15
An integrated proteogenomic investigation of the human liver uncovers molecular drivers of steatotic liver disease

Gobeil, E.; Bourgault, J.; Enault, M.; Cote, V.; Mitchell, P. L.; Ruel, L.-J.; Girard, A. S.; Vohl, M.-C.; Arsenault, B. J.

2026-06-06 endocrinology 10.64898/2026.06.04.26354903 medRxiv
Top 9%
2.0%
Show abstract

Metabolic dysfunction-associated steatotic liver disease (MASLD) is rapidly increasing worldwide, yet effective targeted therapies remain limited. To better understand the molecular mechanisms underlying MASLD, we performed an integrated proteogenomic analysis of human liver tissue. Using mass spectrometry, we quantified 2,744 proteins in 504 liver biopsies from the Quebec Obesity Biobank and examined changes across disease stages. To investigate causality, we integrated liver proteomics with RNA sequencing and genome-wide genotyping to map thousands of protein quantitative trait loci (pQTLs) and expression quantitative trait loci (eQTLs). These molecular data were combined with summary statistics from a meta-analysis of genome-wide association studies including 16,532 MASLD cases and 1,240,188 controls. Mendelian randomization and genetic colocalization analyses revealed that most proteins differentially expressed across MASLD stages were not causally implicated in disease risk, whereas several genetically predicted liver proteins showed evidence of causal effects. Among these, higher hepatic levels of the MTARC1 protein were causally associated with MASLD and hepatic fat accumulation. Phenome-wide analyses suggested that MTARC1 inhibition may reduce the risk of cirrhosis, hepatocellular carcinoma, and cholelithiasis while improving lipid profiles. Notably, the causal MTARC1 variant influenced liver protein levels but not gene expression. Genetic analyses also identified ERLIN1 and HSD17B13 as potential therapeutic targets. In contrast, eQTLs and pQTLs at other loci such as GCKR showed opposite effects on MASLD risk. These findings highlight the importance of integrating tissue proteomics with human genetics to distinguish biomarkers from causal drivers and to identify promising therapeutic targets for MASLD.

16
Subthalamic DBS Engages Right-lateralized Frontal Control to Improve Gait Adaptation in Parkinson's

Hanafi, I.; Pozzi, N. G.; Habib, R.; Falciglia, S.; Del Vecchio Del Vecchio, J.; Remore, L. G.; Marotta, G.; Buck, A.; Pezzoli, G.; Volkmann, J.; Isaias, I. U.; Palmisano, C.

2026-06-09 neurology 10.64898/2026.06.03.26354536 medRxiv
Top 10%
1.7%
Show abstract

Adapting ongoing gait patterns to environmental challenges is essential for safe navigation through the environment. Impairment of gait adaptation is common in many neurodegenerative disorders, such as Parkinson's disease (PD), where it hampers mobility and limits quality of life. The neural control of gait adaptation remains largely unclear, thereby limiting the development of targeted treatments, such as deep brain stimulation of the subthalamic nucleus (STN-DBS). We integrated clinical, kinematic, brain metabolic imaging, and electrophysiological data, obtained during a fully immersive virtual reality overground walking task, to characterize the neural underpinnings of gait adaptation performance during dynamic obstacle avoidance and its improvement with STN-DBS. Movement kinematics, brain oscillatory activity, and metabolic activation were simultaneously acquired in 12 patients with PD during rest and gait adaptation, under active or paused STN-DBS, using inertial measurement units, electroencephalography, and three separate [18F]fluorodeoxyglucose positron emission tomography scans. Eight age-matched healthy subjects completed the same task for comparative kinematic analyses. All patients showed significant clinical improvement with STN-DBS. During the gait adaptation task with paused stimulation, patients exhibited increased metabolic activity in the cerebellum and sensorimotor cortex. Active STN-DBS selectively enhanced thalamic and superior frontal gyrus (SFG) metabolism, while concomitantly reducing cerebellar uptake. Right-lateralized SFG metabolism correlated with gait adaptation performance, with DBS-driven shifts toward greater right SFG activity predicting the magnitude of gait adaptation improvement. This correlation was independent of baseline asymmetry in clinical impairment, electrode placement, or structural connectivity to the SFG. Of note, STN-DBS amplitude asymmetry emerged as an independent predictor of right-lateralization of SFG metabolism. EEG recordings confirmed this lateralized network modulation, with theta-band asymmetry paralleling PET findings. Our findings identify a lateralized thalamo-cortical network supporting gait adaptation in PD and highlight a distinctive role for the SFG. We further show that effective STN-DBS acts as a lateralized regulator, dynamically rebalancing cortico-thalamic circuits to support context-appropriate gait control. The observed right-hemispheric lateralization may foster novel image-guided programming strategies to enhance the consistency and effectiveness of gait control in PD.

17
Stochastic Morphodynamics of the Human Aorta Across the Lifespan

Twohig, K. C.; Mansour, M.; Pugar, J. A.; Yuan, K.; Pocivavsek, L.; Klishin, A. A.

2026-06-08 surgery 10.64898/2026.06.05.26355015 medRxiv
Top 11%
1.7%
Show abstract

Biological systems evolve as continuous dynamical processes, but at organ-scale and across human lifespans they are rarely observed longitudinally--population data typically exist instead as sparse, cross-sectional snapshots. Inferring lifespan dynamics from such data requires methods distinct from those used at cellular and tissue scales where dense observations are accessible. We address this problem in the thoracic aorta, where surgical decisions currently rest on static, age- and sex-agnostic diameter thresholds that reduce three-dimensional morphology to a single scalar. Treating normal aortic morphology as a stochastic dynamical system, we pose a continuous-time drift-diffusion process in a two-coordinate state space of normalized surface area (A) and normalized fluctuation in integrated Gaussian curvature ({delta} K), and fit closed-form solutions of the Fokker-Planck equation by maximum likelihood to a sex-balanced, age-uniform cohort spanning infancy to age 99. Inter-individual variability is treated as a fitted diffusion parameter rather than as residual scatter, which is distinct from prior normative studies that report variability as scatter around a regression line. The framework identifies two growth regimes for aortic size (childhood expansion followed by persistent adult growth, with adult males growing approximately 70% faster than adult females) and a single dynamical regime for aortic shape, with heteroscedastic variability accumulating at a rate comparable to the mean drift over the lifespan. Applied to independent cohorts of acute and chronic thoracic aortic dissections, the multivariate model identifies over 95% as statistical outliers via Mahalanobis distance, consistently outperforming either coordinate alone. The same probabilistic envelope that describes normal aging thus defines a baseline against which disease can be detected, supporting a shift toward dynamic, age- and sex-aware assessment of thoracic aortic pathology.

18
Sensorimotor recovery and neuropathic pain reduction after remotely delivered cognitive multisensory rehabilitation or remotely delivered exercise in adults with spinal cord injury: a pilot clinical trial.

Van de Winckel, A.; Herrmann, A. A.; Carpentier, S. T.; Bottale, S.; Lopez, R. L.; Rapacz, A. D.; Larson, S. J.; Deng, W.; Zhang, L.; Hendrickson, T. J.; Mueller, B. A.; Nourian, R.; Morse, L. R.; Lim, K. O.

2026-06-09 rehabilitation medicine and physical therapy 10.64898/2026.06.02.26354574 medRxiv
Top 12%
1.5%
Show abstract

Introduction: Reduced or lost sensation and movement after a spinal cord injury (SCI) impairs the brain s ability to accurately localize paralyzed body parts, causing deficits in its internal body map, or mental body representations (MBR). These deficits hinder functional recovery and contribute to neuropathic pain. Medications for neuropathic pain are often ineffective and carry side effects. Our pilot trials found that in-person Cognitive Multisensory Rehabilitation (CMR), a physical therapy restoring MBR, led to prolonged pain reduction, improved sensorimotor function, and enhanced brain function, to greater extent than adaptive fitness. To explore more accessible interventions for those in rural areas or with transportation challenges, we examined whether 12 weeks of remotely delivered CMR or exercise would (1) improve function and reduce pain; (2) increase brain activity and connectivity related to sensorimotor function and MBR in adults with SCI. Methods: Of 19 adults with SCI who consented, 15 (51+/-15 years old, 8+/-10 years post-SCI) were randomized to 12 weeks of remotely delivered CMR or exercise (45min, 3x/week). Eight reported neuropathic pain equal or greater than 3/10. The Numeric Pain Rating Scale (NPRS), ASIA Impairment Scale (AIS), and Neuromuscular Recovery Scale (NRS) assessed pain and sensorimotor function at baseline, post-intervention, and 6-month follow-up. Functional MRI included resting-state and four tasks: imagining feeling the left leg, imagining moving the left leg, whole-body movement imagery, and a sensation task. Results: After CMR (n=8), participants improved on AIS (large effect sizes: touch: d=1.30; pinprick: d=1.21; lower limb motor function: d=1.83). Exercise (n=7) produced smaller improvements (touch: d=0.35; pinprick: d=0.36; lower limb motor function: d=0.80). CMR showed greater NRS effect sizes (core: d=1.48; upper limb: d=0.69; lower limb: d=1.25) than exercise (core: d=0.31; upper limb: d=0.74; lower limb: d=0.83). Benefits persisted at follow-up for both AIS and NRS, especially in the CMR group. Highest neuropathic pain intensity decreased in both groups post-intervention (CMR: d=-0.61; exercise: d=-0.73) and at 6-month follow-up (CMR: d=-0.55; exercise: d=-0.55). Unlike previous studies, group effects for CMR were not found due to high heterogeneity. Increased task-based activation, including in the lateral occipital cortex involved in visual body perception and spatial awareness, was seen for the exercise group (n=5). Discussion: These preliminary results support the potential of remotely delivered CMR and exercise to improve function and reduce neuropathic pain in adults with SCI, highlighting the need for larger trials. Clinicaltrial.gov: NCT05870189

19
Immunologically Optimized Zmp1 Peptides Reveal a Translational Serological Biomarker Platform for Tuberculosis Diagnosis Across Disease Manifestations

Zade, O. S.; Yandrapally, S.; Choudhari, K.; Gaikwad, A. V.; Panda, R.; Neela, V. S. K.; Devalraju, K. P.; Eedara, R. V. V.; Ansari, M. S.; Chandrashekhar, C.; Sriram, D.; Mohareer, K.; Valluri, V. L.; Somvanshi, P. R.; Banerjee, S.

2026-06-12 infectious diseases 10.64898/2026.06.11.26355355 medRxiv
Top 13%
1.4%
Show abstract

Tuberculosis (TB) diagnosis remains challenging, particularly for extrapulmonary TB (EPTB), where invasive sampling, low bacillary burden, and suboptimal sensitivity of nucleic acid-based tests in peripheral specimens hinder timely detection. Here, we report an immunology-driven strategy for biomarker discovery and development of a peptide-based serological assay targeting Mycobacterium tuberculosis zinc metalloprotease-1 (Zmp1). Leveraging fundamental principles of adaptive immunity that antigenic regions containing overlapping B-cell and CD4 T-helper cell epitopes would preferentially generate high antibody titers through linked recognition and cognate T-cell help, we used an immunoinformatics pipeline to identify two nested immunodominant peptide regions within Zmp1 (Mtb-Zp-NT and Mtb-Zp-CT) enriched for overlapping B- and T-cell epitopes. The diagnostic potential of these peptides was evaluated through ELISA-based serological assays. A blinded pilot study (N=137) demonstrated a clear discrimination between active TB and TB-recovered individuals. The assay was subsequently validated in an expanded cohort (N=875) by screening 6,086 individuals, which identified 457 TB-positive cases. The cohort included pulmonary TB (PTB), EPTB, TB-recovered individuals, household contacts, non-specific infections, and healthy controls. Receiver operating characteristic analyses, supported by DeLong and bootstrap comparisons, revealed superior diagnostic performance of the peptide-based assays relative to full-length Zmp1. Mtb-Zp-CT exhibited the highest accuracy (AUC=0.93; specificity >90%), while Mtb-Zp-NT also demonstrated strong discriminatory power (AUC{approx}0.89). These findings establish that the immunologically optimized Zmp1 peptides are highly promising serological biomarkers for TB and EPTB. More broadly, they demonstrate how mechanistically informed epitope selection can accelerate translation of pathogen-specific immune signatures into sensitive, minimally invasive, and potentially point-of-care diagnostic platforms for resource-limited settings.

20
PCRAgent: A Multi-Agent Framework for Transforming Noisy clinical conversations into Structured Pre-Consultation Medical Records and Reusable Clinical Data Resources

Zhang, M.; Zhao, J.; Tang, W.; Xing, J.; Li, J.; Zhang, H.; Qiu, J.; Zhang, Y.

2026-06-11 health informatics 10.64898/2026.06.10.26355372 medRxiv
Top 13%
1.4%
Show abstract

In primary care and outpatient settings, clinically important patient information is often embedded in fragmented, ambiguous, repetitive, and noisy communication between physicians and patients. This limits physicians ability to obtain a clear preconsultation overview of symptoms, history of present illness, and visit intent, while also preventing real world clinical dialogues from being reused in hospital information systems and medical artificial intelligence applications. To address this challenge, we developed PCRAgent, a centrally coordinated multi agent framework for preconsultation clinical information organization. Guided by physician inquiry logic, PCRAgent identifies, extracts, corrects, and standardizes patient-reported information from noisy consultations. Its coordinated modules including error detection, semantic editing, output control, contextual memory, and intent recognition enable robust parallel handling of spelling errors, repetitions, grammatical inconsistencies, medical ambiguities, and non-medical interference. A traceable edit list records intermediate corrections and context, allowing iterative refinement without redundant modifications. PCRAgent generates two complementary outputs. One is a PreConsultation Clinical Report for rapid physician review. The other is a Structured Clinical Conversation Dataset for hospital data construction and downstream AI applications. In evaluations using 220000 strongly perturbed consultations, PCRAgent maintained high robustness, achieving a clinical information accuracy of 4.99 out of 5 and key element completeness of 5 out of 5, outperforming GPT4o. Expert review of Chinese and English dialogues confirmed high clinical accuracy of 4.85 out of 5 and high safety of 4.79 out of 5. Multicenter validation in real-world outpatient workflows further demonstrated practical utility. These findings indicate that PCRAgent can efficiently transform noisy and unstructured consultations into physician ready reports and AI ready structured data, improving outpatient efficiency, reducing cognitive burden, ensuring information completeness, supporting precise decision-making, and enabling high-quality reuse of clinical data.